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Abstract 

In statistical problems, a set of parameterized probability distributions 
is used to estimate the true probability distribution. If Fisher information 
matrix at the true distribution is singular, then it has been left unknown 
what we can estimate about the true distribution from random samples. 
In this paper, we study a singular regression problem and prove a limit 
theorem which shows the relation between the singular regression problem 
and two birational invariants, a real log canonical threshold and a singular 
fluctuation. The obtained theorem has an important application to statis- 
tics, because it enables us to estimate the generalization error from the 
training error without any knowledge of the true probability distribution. 

1 Introduction 

Let M and N be natural numbers, and M*^ and be M and N dimensional 
real Euclidean spaces respectively. Assume that {ft, B, P) is a probability space 
and that {X, Y) is an R*^ x R^-valued random variable which is subject to a 
simultaneous probability density function, 

/ N ( \y-rQ{x)\'^ ^ 
'^(-' y) - (WF^ i 2a^) ' 

where q{x) is a probability density function on R*^, cr > is a constant, ro(x) 
is a measurable function from R*^ to R^, and | • | is the Euclidean norm of 
R-'^. The function ro(x) is called a regression function of q{x,y). Assume that 
{{Xi,Yi);i — 1, 2, n} is a set of random variables which are independently 
subject to the same probability distribution as {X,Y). Let W he a subset of 
R"*. Let r{x,w) be a function from R^^ x W to M.^ . The square error H{w) is 
a real function on W, 

n 

|2 
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An expectation operator [ ] on is defined by 



F{w) exp{—l3H{'w))ip{w)dw 

E^[F{w)] = '—^ , (1) 

eyi^[—j3H[w))ip{w)dw 



where F{vj) is a measurable function, •^{w) is a probability density function on 
W , and /? > is a constant called an inverse temperature. Note that Ew[F{w)\ 
is not a constant but a random variable because H{w) depends on random 
variables. Two random variables G and T are defined by 

G = ^ExEy[\Y - E^[riX,w)]\% 
1 " 
1=1 

These random variables G and T are called the generalization and training errors 
respectively. Since i?;ic,Y[|y — ro(X)p] = Na^, it is expected on some natural 
conditions that both E[G] and E[T] converge to 5* = 7Vcr^/2 when n tends to 
infinity if there exists wq d W such that r(x,WQ) — ro{x). In this paper, we 
ask how fast such convergences are, in other words, our study concerns with 
a limit theorem which shows the convergences n{E[G] — S) and n{E[T] — S), 
when n — > oo. If Fisher information matrix 

where di = (d/dwi), is positive definite for arbitrary w € W, then this problem 
is well known as a regular regression problem. In fact, in a regular regression 
problem, convergences n{E[G] - S) ^ and n{E[T] - S) ^ -da'^/2 hold. 

However, if I{u)o) = {lijiwo)} is singular, that is to say, if detJ(w;o) — 0, 
then the problem is called a singular regression problem and convergences of 
n{E[G] - S) and n{E[T] - S) have been left unknown. 

In general it has been difficult to study a limit theorem for the case when 
Fisher information matrix is singular. However, recently, we have shown that a 
limit theorem can be established based on resolution of singularities, and that 
there are mathematical relations between the limit theorem and two birational 
invariants in singular density estimation |16l 1171 118j . In this paper we prove 
a new limit theorem for the singular regression problem, which enables us to 
estimate birational invariants from random samples. The limit theorem proved 
in this paper has an important application to statistics, because the expectation 
value of the generalization error E[G] can be estimated from that of the training 
error E[T] without any knowledge of the true probability distribution. 

Example Let M = iV = 1, = 4, iv ^ (a, 5, c, d), and W = {w e R^; \w\ < 1}. 
If the function r{x, w) is defined by 

r(x,w) — asin(6x) +csin(dx), 
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and ro(x) = 0, then the set {w G W;r{x,w) = ro{x)} is not one point, 
and Fisher information matrix at {a,b,c,d) = (0,0,0,0) is singular. A lot 
of functions used in statistics, information science, brain informatics, and bio- 
informatics are singular, for example, artificial neural networks, radial basis 
functions, and wavelet functions. 

2 Main Results 

We prove the main theorems based on the following assumptions. 
Basic Assumptions. 

(Al) The set of parameters W is defined by 

W = {we R'^;T:j{w) > (j = l,2,...,fc)}, 

where TTj{w) is a real analytic function. It is assumed that is a compact set 
in M'' whose open kernel is not the empty set. The probability density function 
(p{w) on W is given by 

ip{w) = ipi{w)(p2{w), 

where ifi{w) > is a real analytic function and ip2{w) > is a function of class 

(A2) Let s > 8 be the number that is equal to 4 times of some integer. There 
exists an open set W* D W such that r{x,'w) — ro{x) is an L'*(g)-valued analytic 
function on W* , where L'^{q) is a Banach space defined by using its norm | \s, 

L%q) = {/; \fU = ( / \f{xWq{x)dx) '^'< ex.}. 

(A3) There exists a parameter wq G W such that r{x,wo) — rolx). 
If these basic assumptions are satisfied, then 

K{w) = i y" \r{x, w) - ro{x)\^q{x)dx (2) 

is a real analytic function on W* . A subset Wa C is defined by 

Wa^{w€W ; K{w) < a}. 

Note that Wq is the set of all points that satisfy K{w) = 0. In general, Wq is 
not one point and it contains singularities. This paper gives a limit theorem for 
such a case. Proofs of lemmas and theorems in this section are given in section 

m 

Lemma 1. Assume (Al), (A2), and (A3) with s > 4. Then 

C,{z) = I K{wy(p{w)dw 
Jw 



3 



is a holomorphic function on Re{z) > which can be analytically continued to 
the unique meromorphic function on the entire complex plane whose poles are 
all real, negative, and rational numbers. 

Lemma 2. Assume (Al), (A2), and (A3) with s > 8. Then there exists a 
constant v — > such that 



V ^ 

1=1 



n 

HXr,w)f ] - I E^[r{X,,w)] p 



satisfies 



lim E[V] = (3) 

Based on Lemma [1] and [2J we define two important values X,iy > 0. 

Definition 2.1. Let the largest pole of (^(z) be (—A) and its order m. The 
constant X > is called a real log canonical threshold. The constant v — 
is referred to as a singular fluctuation. 

The real log canonical threshold is an important invariant of an analytic set 
K{w) = 0. For its relation to algebraic geometry and algebraic analysis, see 
[H m El ISl Uni E] • it is also important in statistical learning theory, and it can 
be calculated by resolution of singularities [THl [3] • The singular fluctuation is 
an invariant of K{w) = which is found in statistical learning theory [151 118] . 
whose relation to singularity theory is still unknown. The foUowings are main 
theorems of this paper. 

Theorem 1. Assume the basic assumptions (Al), (A2), and (A3) with s > 8. 
Let S = Then 

Yixa n{E[G]- S) = ^^—^ + ua'', (4) 

lim n(£;[T] - 5) = ^ - i^cr^ (5) 

n— p 

This theorem shows that both the real log canonical threshold A and singular 
fluctuation v determine the singular regression problem. 

Theorem 2. Assume the basic assumptions (Al), (A2), and (A3) with s > 12. 
Then 

where On is a function of n which satisfies non — > 0. 

By this theorem, V and T can be calculated from random samples without 
any direct knowledge of the true regression function ro(a;). Therefore, E[G\ 
can be estimated from random samples, resulting that we can find the optimal 
model or hyperparameter for the smallest generalization error. If the model is 
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regular, then X = v = d/2 for arbitrary < /? < oo, resulting that Theorem [5] 
coincides with AIC [1] of a regular statistical model. Therefore, Theorem [2] is 
a widely applicable information criterion, which we can apply to both regular 
and singular problems. We use Theorem [2] without checking that the true 
distribution is a singularity or not. 



3 Preparation of Proof 

We use notations, S ~ Na2/2 and 



f{x,w) r{x,w) - roix). 

Then {Si} are independent random variables which are subject to the normal 
distribution with average zero and covariance matrix a^I where I is the d x d 
identity matrix. It is immediately derived that 



1 " 

E[T] = S~E\-y2s,-E^[f{X,,w)] 
In ^-^ 

1=1 

1 " 



E{G\ 

E\V\ 



S+-E{Ex{\E^{!{X,w)n 

n 

E[£{E4\f{X,,w)\'] - \E^,[fiX,,wW} 



The function f{x,w) is an L^{q)-valued analytic function on W* . In eq.(IT|), we 
can define Eyj[ ] by replacing H{w) by iJo(w), 



which can be rewritten as 

Ho{w) = nK{w) ~ \fn rin{w), 
where K{w) is given in eq.®, and 



1 " 

1 " 1 

J2{K{w)--\f{X,,w)\'). 

^ i—1 
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We define a norm || || of a function of / on W by 

11/11 = sup |/H|. 
wew 

Since is a compact set of M'', the set B{W) that is a set of all continuous 
and bounded function on is a Polish space, and both r]n\w) and r}k\w) are 
-B(VF)-valued random variables. Because f{X,w) is an L*(g)-valued analytic 
function, {rj^^} and {r]n^} are tight random processes, resulting that rj'i^^ and 
Tyi^-* weakly converge to unique tight gaussian processes 77*^'^'' and ry'^^-' respectively 
which have the same covariance matrices as rjn'^ and rjn^ respectively when 

00 [niniiiH]. 

Lemma 3. Assume (Al), (A2), and (A3) with s > 8. Then 

E[Wr] < 00, 
E[Wr^'] < 00, 

Proof. Since f{x,w) is an ((j)-valued analytic function, it is represented by 
the absolutely convergent power series f{x,w) — J2j '^ji^)'^'' which satisfies 
< M{x)/r^ for some function M{x) G i*(<z) where r = {ri, ..,rd) is the 
associative convergence radii. By using this fact, the former inequality is proved 
[T71II3- Also K{w) - (l/2)/(x, w)^ is an L*/2((7)-valued analytic function, the 
latter inequality is proved. □ 

Lemma 4. For arbitrary natural number n, 

1 " 

E[E,,[V^rjl^\w)]] = E[E^[nK{w)--J2\f{X,,w)\% 

i=l 

Proof. The second equation is trivial. Let us prove the first equation. Let the 
left hand side of the first equation be A. Since {Si} are independently subject 
to the normal distribution with covariance matrix a^I, 

n 

A = E[J2s^■E.^[f{X„w)] 
1=1 

n 

= a^E[Y,^srE^,[f{X,,w)] 

n 

= a'E[Y^Vs., 



i=l 

J f{Xi,w) e^p{~l3Ho{w))ip{w)dw 



i=l 



J cxp(—f3HQ(w))ip{w)dw 



l3a^E'\y 



Y,E^[\f{X,,w)\'] - \E^[f{X,,w) 

4=1 



which is equal to the right hand side of the first equation. □ 
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Definition 3.1. Let us define five random variables. 

D, - nE^[Ex[\fiX,w)\% 
D2 = nEx[\E^[fiX,w)]\% 

n 
n 

Di = Y,\E^,[f{X,,w)]\\ 

D5 = E^iVn r]niw)]. 
Then, by using Lemma 31 it follows that 

E[G] = S+^E[D,l (6) 

E[T] = S-(^E[D3-D,] + ^E[D,], (7) 

E[V] = E[D3-Di], (8) 
E[D5] - l3a^E[D3^D4] + {l/2)E[Di-D3]. (9) 

We show that five expectation values E[Dj] {j = 1,2,3,4,5) converge to 
constants. To show such convergences, it is sufficient to prove that each Dj 
weakly converges to some random variable and that EKDjY^^] < C for some 
5 > and constant C > ^13j . 

Definition 3.2. For a given constant e > 0, a localized expectation operator 
El,[ ] is defined by 

F{w) e'}q){~ 13 HQ{w))ip{w)dw 
exp{— P Ho{w))(p{w)dw 



E:^[Fiw)] = . (10) 



K(w)<e 

Let Dl {i — 1,2,3,4,5) be random variables that are defined by replacing ] 

byEU ]. 

Lemma 5. Let < S < s/4 ~ 1. For arbitrary e > 0, j = 1, 2, 3, 4, 5, 

lim E[\D, ~ D'\^+^] ^0. 

Proof. We can prove five equations by the same way. Let us prove the case 
j = 3. Let L{w) = X]"=i Ifi^ij''^)]^- Because f{x,w) is L'*(g)-valued analytic 
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function, E[{\\L\\/n)^+^] < oo. 

/ L{w) exp{—f3Ho{w))ip{w)dw 

JK(w)>e 



\D3-Dl\ < 



< 



exp{— P Ho{w))ip{w)dw 

K(w)<t 

ll^jj g-ri;3£+2/3v^||r)„|| 



!k(w)<c exp{-(3nK{w))(p{w)dw 
< Ci n^/^WLW exp(-n/3e/2 + (2/3/e)||r7„|p) 

where we used 2y^||77„|| < (ne/2 + (2/e)||?7„|p) and Ci > is a constant. From 
LemmaEl E[\\r]n\['^^] = C2 < oo, hence by using C3 = {8e^y^'^C2, 

P{\\Vnr>n/{8e^))<Cs/n^^\ 

Let be the expectation value of F{x)Ia{x) where Ia{x) is the defining 

function of a set A, in other words, /a (a;) = 1 if x e A or if otherwise. 

E[\Ds~D^X^'] ii;[|i?3-i^^r+']{||„„|P>„/(8.2)} 

+E[\D3 - i:)^|^+'']{||^^j|2<„/(8e2)}. 

The first term of the right hand side is not larger than C3£'[||i|| ^+'']/n'^/^ and 
the second term is not larger than _E[(Ci||i||)^+'^]n'^/^ exp(— n/3e/4). Both of 
them converge to zero. □ 



4 Resolution of Singularities 

To study the expectation on the region W^^ we need resolution of singularities 
because Wq contains singularities in general. Let e > be a sufficiently small 
constant. Then by applying Hironaka's theorem [7| to the real analytic function 
^{w) rij^i s-ll functions K[w\ Trj{w), and ipi{w) are made normal 

crossing. In fact, there exist an open set W* C W* which contains We, a 
manifold U* , and a proper analytic map g : U* W* such that in each local 
coordinate of U* , 

K{g{u)) = 

where k = {ki,...,kd) and h — {hi, h^) are multi-indices (kj and hj are 
nonnegative integers), u^'^ = Hj "j^^^j = Y\j "j'j is the absolute value 

of Jacobian determinant oi w = g{u), and (/)(u) > is a function of class C°°. 
Let U = g~'^{We)- Since g is a proper map and is compact, U is also 
compact. Moreover, it is covered by a finite sum 
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where each Ua can be taken to be [0,5]'' in each local coordinate using some 
6 > 0, and 

/ F{w)^{w)dw = y^ [ F{g{u))ct>^{u)\u''\du, 
Jw^ „ JUc 

where (j)a{u) > is a function of class C°°. In this paper, we apply these facts 
to analyzing the singular regression problem. For resolution of singularities and 
its applications, see [7] and Lemma [T] is directly proved by these facts 

[HIHIIIS]- Moreover, the following lemma is simultaneously obtained. 

Lemma 6. The largest pole (—A) and its order m of C,{z) are given by 

m = niax#{j;A-^^^}, (12) 

where, if kj = 0, (/ij+i + l)/2kj is defined to be +oo and # shows the number 
of elements of the set. Let be the set of all local coordinates that attain 

both mhia in eg. ill]) and maxc, in ea. lll2\) . Such coordinates are referred to as 
the essential coordinates. 

For a given real analytic function K{w), there are infinitely many different 
resolutions of singularities. However, A and m do not depend on the pair {U* , g). 
They are called birational invariants. By the definition of K{w) in eq.([2]), there 
exists an ((7)-valued analytic function a(x,u) on each local coordinate in U* 
such that 

fix, u) = a{x, u)u^ 
and Ex[\a{X,u)\'^] = 2. Therefore, 

Ho{g{u)) =nu^^ -yf^u^ 

where 

Then ^^[ll^nll*/^] < oo and ^^[ll V^„||''/^] < oo, because both a{x, u) and Va{x, u) 
are I/" ((?)-valued analytic function, where ||V^„|| = maxj sup^„ \dj^n{u)\. The 
expectation operator ] on J7 is defined so that it satisfies El,[F(w)] = 

E4Fig{u))]. Then 

D'^ = nEx[\E^[a{X,u)u'']\^], 

n 

Dl = ^i?„[|a(X„«)|V^], 

n 
i=l 

Dl = Eu[V^Uu)u% 
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Lemma 7. Let s > 12 and < 5 < s/6 — 1. For i ~ 1,2, 3, 4, 5, there exists a 
constant C > such that E[{Dl)'^+^] < C holds. 

Proof. Since < L>| < < < Dl, and |i:>§| < {\\C„f + 2Dl)/2, it is 
sufBcient to prove j — 1,3. The proof for j = 1, 3 can be done by the same way. 
Let us prove the case j = 3. In Z = 1, 2, .., d, at least one of fc; > 1. By using 
partial integration for dui, we can show that there exists ci > such that 

Eu[u''']<^-^{l + \\U\? + \\^U\?}. (13) 

Therefore by using L — (1/n) Y^^=i Holder's inequality with 1/3 + 

1/(3/2) = 1, 

E[{DlY+'] < E[{c^L{l + UnW + l|Ve.in)^+'] 

< E[{ciLf+^']'/'E[{l + ||C„ir + ||Ve„|P)('+'')/']'/^ 

Since i;[||a(X)||^] < oo, E[\\£_n\\"^^] < oo, and E[\\\/Cn\\"^^] < oo, this expecta- 
tion is finite. □ 



5 Renormalized distribution 

Definition 5.1. For a given function h{u) on U , the renormalized expectation 
operator E* j [ \h] is defined by 

dtj D{du)F{u,t)t^-^e-^*+^^"'^^'> 

E:^,[F{u,t)\h] = ° „oo 7 , 

^ y dtj D{du)t^-'e-^'+P^' 

where D{du) is a measure which is defined in eg. \lb]) and X]q» shows the sum 
of all essential coordinates. Also we define 

Dlih) = El,[2tH 

D;{h) - Ex[\EltHX,u)Vt]\'\h], 

DUh) = El,[hiu)V~t\h]. 

Lemma 8. The following convergences in probability hold. 





-Dl{in) - 


0, 




-d;{u) - 


0, 


Dl 


-Diiu - 


^ 0, 


Dl 


-Dl{in) - 


^ 0, 


Dl 


-DliU - 


0. 
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Proof. These five convergences can be proved by the same way. We show Dg — 
A*(e„) ^ 0. Let L{u) = (l/n)Er=i HX,,u)\\ Since Ex[\a{X,u)\^] = 2, 

\Dl-Dl{U)\ < \Eu[nL{u)u^'']-Eu[Ex[a{X,uf]u^'']\ 
+ \E^[2u''^] - E:j2t\U]\. 

Let the first and second terms of the left hand side of this inequality be Dq and 
D7 respectively. Then 

De < \\L~aiX)\\^E^[nu^'']. 

By the convergence in probability \\L — a{X)\\ and eg. p^ . Dq converges 
to zero in probability. From Lemma [TUl and [TT] in Appendix, it is derived that 

mu''^]-ElAt\U\ < ,lj l + /^l|Vg„||}, (14) 

iognmin^^)-^ 

which shows — > in probability. □ 

Lemma 9. For arbitrary function h(u), the following equality holds. 

9\ 

Dl{h)^Dl{h) + j. 
Proof. Let Fp{u) be a function defined by 



Then by using the partial integration of dt, 

F,{u) = h{u)F,/^{u) + ^Fo{u). 

By the definition of Dl{h) ^ El .t[2t\h] and = £;* Jft.(u)\/t|/i], we obtain 

the lemma. □ 



6 Proof of Main Theorems 

6.1 Proof of Lemma [1] 

Proof. Lemma [1] is already proved in section (4] □ 

6.2 Proof of Lemma [2] 

Proof By the definition, V = D3 - D4. By Lemma [5] and H E[V^+^] < 00. 
Reall that the convergence in law £,n ^ £, holds. The random variable Dl{(_n) — 
-^2(Cn) is a continuous function of hence it converges to a random variable 
Dl{^) — D'2{£,) in law. Therefore, by Lemma [5] and [U D3 — D4 converges to 
the same random variable in law. Hence E[V] converges to a constant when n 
tends to infinity. □ 
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6.3 Proof of Theorem [T] 

Proof. By the same way as proof of Lemma [U both E[Di] and £[0^] converge 
to E[Dl (^)] whereas both E[D2] and E[D4\ converge to E[D* )]. From eqs.® , 
O, and §1 

E[n{G~S)] ^ \E[Dm, 

E[n{T~S)] ^ -2a'u+^E[D;iO], 

E[V] ^ E[Dim-E[Dmi 

where we used the definition of ly, that is to say, E[Dl{£^) — — Ivj^. 

From Lemma [HI 

resulting that 

which completes the theorem. □ 

6.4 Proof of Theorem d 

Proof. From Theorem [TJ 



E[G] = 



i^cr ho. 



2 \ P In 
r 1 ^cr^ (\-v n\ 1 



where no„ ~> 0. Therefore 

E\G\ = £;[T] 

To prove TheoremEl it is sufficient to show E^T\ - E\y\E\r\ 0. 

E^V{T - £;[T])|] < S[y2]i/2£;[(r _ E\r\f\^i'^ . 
Since s/4 - 1 > 2, 

< £;[\^2] < £;[(i53)2] < oo. 
Let = i Y2=x I^«IV2, ^ = a2iV/2. Then 

E{{T - E\F\f\ < 3E[{T - S^^'^f + (^("^ - + (5 - E[T]f 
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Firstly, from 

rr. _ ain) _ E^[r]n{w)] D3 

we obtain 

n n 

which converges to zero. Secondly, {Si\ are independently subject to the normal 
distribution, hence £'[(5(") - Sf] 0. And lastly, 

n 

hence E[{T ~ 5)^] also converges to zero. □ 



7 Conclusion 

In this paper, we proved that singular regression problem is mathematically 
determined by two birational invariants, the real log canonical threshold and 
singular fluctuation. Moreover, there is a universal relation between the gener- 
alization error and the training error, by which we can estimate two birational 
invariants from random samples. 



Appendix 

To prove eq.((T4]), we use the following lemmas. Let ^ and ip are functions of 
class from [0, bY to M. Assume that ip{u) > 0, w = (x, y) e [0, bY. The partition 
function of ^, iy9, n > 1, and p > is defined by 

= / dx [ dy K{x,yY x^y''\{x,y) 

X exp(-n/3 K{x, y f + K{x, y) £_{x, y)). (15) 



where K{x, y) = x'^y'' . Let us use 

U\\ = , max 



IV^II = max max 

l<j<m {x,y)e[0,b]'^ 



dxj 



Without loss of generality, we can assume that four multi-indices k, k' , h, h' 
satisfy 

hi + 1 K + 1 , h' +1 



In this appendix, we define a{n,p) = (logn)™ ^/n'^^^ 
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Lemma 10. There exist constants ci,C2 > such that for arbitrary ^ and ip 
(f{x) > G [0, and an arbitrary natural number n > 1, 

ci a{n,p) e^'^ll^ll^/^ niin((^) < Z^{n,S^,Lp) < C2 a{n,p) e'^H^H^/^ \\ip\\ 
holds, where min{ip) ~ min ip{u). 

Let ^ and (p be functions of class C^. We define 

YP{n,^,^) = ja{n,p) H dt [ dy t^+P-'y^e-P'+P^'^^'^y^ ^y), 

Jo "'[0,6] = 

where we use notations, 7 = &l'i|+™-2|fc|A/(2™(m - 1)! Hj^m+i ^oiv) = 
£,{Q,y), fo{y) = 'P(0,y), = h' — 2Xk'. A measure D{du) on is defined by 

D{du) = 7<5(x)y^. (16) 

Lemma 11. There exists a constant C3 > such that, for arbitrary n > 1, ^, 
if, and p > 0, 

\ZP{n,t^)-YPin,^,^)\ 

Moreover, there exist constant 04,05 > such that, for arbitrary ^, ip, n> 1, 

C4 a{n,p) e^'^ll«"'/2min(^) < YP{n,^,ip) < cg a{n,p) e^^ll^H'/^ \\ip\\. 

Proof. Lemmas [TU] and [TT] are proved by direct but rather compficated calcu- 
lations [miTS]. Let us introduce the outline of the proof. Let Fp{x,y) be the 
integrated function in eq. p3|) and ZP = ZP{n,£^,(f). 

ZP = J dx J dyFp{x,y), 

which is equal to 

ZP= f dt I dxdyS{t-K{x,yf)Fp{x,y). (17) 
"'0 J[a,b]'' 

Therefore, the problem results in 5{t — K{x, yY). For arbitrary function ^{x, y) 
of class C°° , the function 

C(z) = / K{x,yf'-^{x,y) dxdy 

is the meromorphic function whose poles are (— Aj) and its order m^, hence it 
has Laurent expansion. 
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where Co{z) is a holomorphic function and Cj(\E') is a Schwartz distribution. 
Since / 6{t — K{x,y)^)'i>{x,y)dxdy is the Melhn transform of C{z)i we have an 
asymptotic expansion of 6{t — K{x, y)^) for t —f +0, 

oo "ij 

5{t~K{x,yf) ^T.T. t^^-\-^ogtr-^c,„,{x,y), 

j — l m— 1 

where Cjm{x,y) is a Schwartz distribution. By applying this expansion to 
ea. pT|) . we obtain two lemmas. □ 
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